Pixel Difference Convolutional Network for RGB-D Semantic Segmentation

نویسندگان

چکیده

RGB-D semantic segmentation can be advanced with convolutional neural networks due to the availability of Depth data. Although objects cannot easily discriminated by just 2D appearance, local pixel difference and geometric patterns in Depth, they well separated some cases. Considering fixed grid kernel structure, CNNs are limited lack ability capture detailed, fine-grained information thus achieve accurate pixel-level segmentation. To solve this problem CNN we propose a Pixel Difference Convolutional Network (PDCNet) detailed intrinsic aggregating both intensity gradient range for data global RGB data, respectively. Precisely, PDCNet consists branch an branch. For branch, Convolution (PDC) consider via information. contribute lightweight Cascade Large Kernel (CLK) extend PDC, namely CPDC, enjoy contexts further boost performance. Consequently, differences from modal seamlessly incorporated into during propagation process. Experiments on three challenging benchmark datasets, i.e ., NYUDv2 (78.4 Acc., 53.5 mIoU), SUN (83.3 49.6 mIoU) SID Dataset (83.1 61.4 reveal that our achieves state-of-the-art performance task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dense RGB-D semantic mapping with Pixel-Voxel neural network

For intelligent robotics applications, extending 3D mapping to 3D semantic mapping enables robots to, not only localize themselves with respect to the scene’s geometrical features but also simultaneously understand the higher level meaning of the scene contexts. Most previous methods focus on geometric 3D reconstruction and scene understanding independently notwithstanding the fact that joint e...

متن کامل

Training Bit Fully Convolutional Network for Fast Semantic Segmentation

Fully convolutional neural networks give accurate, per-pixel prediction for input images and have applications like semantic segmentation. However, a typical FCN usually requires lots of floating point computation and large run-time memory, which effectively limits its usability. We propose a method to train Bit Fully Convolution Network (BFCN), a fully convolutional neural network that has low...

متن کامل

Depth-aware CNN for RGB-D Segmentation

Convolutional neural networks (CNN) are limited by the lack of capability to handle geometric information due to the fixed grid kernel structure. The availability of depth data enables progress in RGB-D semantic segmentation with CNNs. State-of-the-art methods either use depth as additional images or process spatial information in 3D volumes or point clouds. These methods suffer from high compu...

متن کامل

Per-Pixel Feedback for improving Semantic Segmentation

Semantic segmentation is the task of assigning a label to each pixel in the image.In recent years, deep convolutional neural networks have been driving advances in multiple tasks related to cognition. Although, DCNNs have resulted in unprecedented visual recognition performances, they offer little transparency. To understand how DCNN based models work at the task of semantic segmentation, we tr...

متن کامل

A hierarchical Convolutional Neural Network for Segmentation of Stroke Lesion in 3D Brain MRI

Introduction: Brain tumors such as glioma are among the most aggressive lesions, which result in a very short life expectancy in patients. Image segmentation is highly essential in medical image analysis with applications, particularly in clinical practices to treat brain tumors. Accurate segmentation of magnetic resonance data is crucial for diagnostic purposes, planning surgical treatments, a...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology

سال: 2023

ISSN: ['1051-8215', '1558-2205']

DOI: https://doi.org/10.1109/tcsvt.2023.3296162